智能论文笔记

Machine Learning Partners in Criminal Networks

Diego D. Lopes , Bruno R. da Cunha , Alvaro F. Martins , Sebastian Goncalves , Ervin K. Lenzi , Quentin S. Hanley , Matjaz Perc , Haroldo V. Ribeiro

分类：机器学习 | (统计)机器学习

2022-09-07

最近的研究表明，犯罪网络具有复杂的组织结构，但是是否可以用来预测犯罪网络的静态和动态特性。在这里，通过结合图表学习和机器学习方法，我们表明，可以使用政治腐败，警察情报和洗钱网络的结构性特性来恢复缺失的犯罪伙伴关系，区分不同类型的犯罪和法律协会以及预测犯罪分子之间交换的总金额，所有这些都具有出色的准确性。我们还表明，我们的方法可以预期在腐败网络的动态增长过程中，其准确性很高。因此，与在犯罪现场发现的证据类似，我们得出结论，犯罪网络的结构模式具有有关非法活动的重要信息，这使机器学习方法可以预测缺失的信息，甚至预测未来的犯罪行为。

translated by 谷歌翻译

Optimization of Artificial Neural Networks models applied to the identification of images of asteroids' resonant arguments

Valerio Carruba , Safwan Aljbaae , Gabriel Caritá , Rita Cassia Domingos , Bruno Martins

分类：机器学习

2022-07-28

小行星主带通过平均动力和世俗共振的网络越过，这在小行星和行星的基本频率之间具有相当性时发生。传统上，这些对象是通过视觉检查其共鸣论点的时间演变来识别的，它们是小行星和扰动星球的轨道元素的结合。由于在某些情况下，受这些共振影响的小行星人口是数千个的顺序，因此对于人类观察者来说，这已成为一项纳税任务。最近的作品使用卷积神经网络（CNN）模型自动执行此类任务。在这项工作中，我们将此类模型的结果与一些最先进和可公开的CNN体系结构（如VGG，Inception和Resnet）进行了比较。首先使用验证集和一系列正规化技术（例如数据扩展，辍学和批处理标准）进行测试和优化此类模型的性能。然后使用三个最佳模型来预测包含数千张图像的较大测试数据库的标签。事实证明，有和没有正规化的VGG模型是预测大型数据集标签的最有效方法。由于Vera C. Rubin天文台在未来几年内可能会发现多达四百万个新的小行星，因此这些模型的使用可能会非常有价值，以识别共鸣的次要人群。

translated by 谷歌翻译

Scaling Painting Style Transfer

Bruno Galerne , Lara Raad , José Lezama , Jean-Michel Morel

分类：计算机视觉

2022-12-27

Neural style transfer is a deep learning technique that produces an unprecedentedly rich style transfer from a style image to a content image and is particularly impressive when it comes to transferring style from a painting to an image. It was originally achieved by solving an optimization problem to match the global style statistics of the style image while preserving the local geometric features of the content image. The two main drawbacks of this original approach is that it is computationally expensive and that the resolution of the output images is limited by high GPU memory requirements. Many solutions have been proposed to both accelerate neural style transfer and increase its resolution, but they all compromise the quality of the produced images. Indeed, transferring the style of a painting is a complex task involving features at different scales, from the color palette and compositional style to the fine brushstrokes and texture of the canvas. This paper provides a solution to solve the original global optimization for ultra-high resolution images, enabling multiscale style transfer at unprecedented image sizes. This is achieved by spatially localizing the computation of each forward and backward passes through the VGG network. Extensive qualitative and quantitative comparisons show that our method produces a style transfer of unmatched quality for such high resolution painting styles.

translated by 谷歌翻译

The State of the Art in Enhancing Trust in Machine Learning Models with the Use of Visualizations

A. Chatzimparmpas , R. Martins , I. Jusufi , K. Kucher , Fabrice Rossi , A. Kerren

分类：机器学习 | (统计)机器学习

2022-12-22

Machine learning (ML) models are nowadays used in complex applications in various domains, such as medicine, bioinformatics, and other sciences. Due to their black box nature, however, it may sometimes be hard to understand and trust the results they provide. This has increased the demand for reliable visualization tools related to enhancing trust in ML models, which has become a prominent topic of research in the visualization community over the past decades. To provide an overview and present the frontiers of current research on the topic, we present a State-of-the-Art Report (STAR) on enhancing trust in ML models with the use of interactive visualization. We define and describe the background of the topic, introduce a categorization for visualization techniques that aim to accomplish this goal, and discuss insights and opportunities for future research directions. Among our contributions is a categorization of trust against different facets of interactive ML, expanded and improved from previous research. Our results are investigated from different analytical perspectives: (a) providing a statistical overview, (b) summarizing key findings, (c) performing topic analyses, and (d) exploring the data sets used in the individual papers, all with the support of an interactive web-based survey browser. We intend this survey to be beneficial for visualization researchers whose interests involve making ML models more trustworthy, as well as researchers and practitioners from other disciplines in their search for effective visualization techniques suitable for solving their tasks with confidence and conveying meaning to their data.

translated by 谷歌翻译

Asking Clarification Questions for Code Generation in General-Purpose Programming Language

Haau-Sing Li , Mohsen Mesgar , André F. T. Martins , Iryna Gurevych

分类：自然语言处理

2022-12-19

Code generation from text requires understanding the user's intent from a natural language description (NLD) and generating an executable program code snippet that satisfies this intent. While recent pretrained language models (PLMs) demonstrate remarkable performance for this task, these models fail when the given NLD is ambiguous due to the lack of enough specifications for generating a high-quality code snippet. In this work, we introduce a novel and more realistic setup for this task. We hypothesize that ambiguities in the specifications of an NLD are resolved by asking clarification questions (CQs). Therefore, we collect and introduce a new dataset named CodeClarQA containing NLD-Code pairs with created CQAs. We evaluate the performance of PLMs for code generation on our dataset. The empirical results support our hypothesis that clarifications result in more precise generated code, as shown by an improvement of 17.52 in BLEU, 12.72 in CodeBLEU, and 7.7\% in the exact match. Alongside this, our task and dataset introduce new challenges to the community, including when and what CQs should be asked.

translated by 谷歌翻译

Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation

Nuno M. Guerreiro , Pierre Colombo , Pablo Piantanida , André F. T. Martins

分类：自然语言处理 | 机器学习

2022-12-19

Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can unpredictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the problem of hallucination detection in NMT by following a simple intuition: as hallucinations are detached from the source content, they exhibit encoder-decoder attention patterns that are statistically different from those of good quality translations. We frame this problem with an optimal transport formulation and propose a fully unsupervised, plug-in detector that can be used with any attention-based NMT model. Experimental results show that our detector not only outperforms all previous model-based detectors, but is also competitive with detectors that employ large models trained on millions of samples.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

Towards fully automated deep-learning-based brain tumor segmentation: is brain extraction still necessary?

Bruno Machado Pacheco , Guilherme de Souza e Cassia , Danilo Silva

分类：计算机视觉

2022-12-14

State-of-the-art brain tumor segmentation is based on deep learning models applied to multi-modal MRIs. Currently, these models are trained on images after a preprocessing stage that involves registration, interpolation, brain extraction (BE, also known as skull-stripping) and manual correction by an expert. However, for clinical practice, this last step is tedious and time-consuming and, therefore, not always feasible, resulting in skull-stripping faults that can negatively impact the tumor segmentation quality. Still, the extent of this impact has never been measured for any of the many different BE methods available. In this work, we propose an automatic brain tumor segmentation pipeline and evaluate its performance with multiple BE methods. Our experiments show that the choice of a BE method can compromise up to 15.7% of the tumor segmentation performance. Moreover, we propose training and testing tumor segmentation models on non-skull-stripped images, effectively discarding the BE step from the pipeline. Our results show that this approach leads to a competitive performance at a fraction of the time. We conclude that, in contrast to the current paradigm, training tumor segmentation models on non-skull-stripped images can be the best option when high performance in clinical practice is desired.

translated by 谷歌翻译

Explaining Agent's Decision-making in a Hierarchical Reinforcement Learning Scenario

Hugo Muñoz , Ernesto Portugal , Angel Ayala , Bruno Fernandes , Francisco Cruz

分类：人工智能 | 机器学习

2022-12-14

Reinforcement learning is a machine learning approach based on behavioral psychology. It is focused on learning agents that can acquire knowledge and learn to carry out new tasks by interacting with the environment. However, a problem occurs when reinforcement learning is used in critical contexts where the users of the system need to have more information and reliability for the actions executed by an agent. In this regard, explainable reinforcement learning seeks to provide to an agent in training with methods in order to explain its behavior in such a way that users with no experience in machine learning could understand the agent's behavior. One of these is the memory-based explainable reinforcement learning method that is used to compute probabilities of success for each state-action pair using an episodic memory. In this work, we propose to make use of the memory-based explainable reinforcement learning method in a hierarchical environment composed of sub-tasks that need to be first addressed to solve a more complex task. The end goal is to verify if it is possible to provide to the agent the ability to explain its actions in the global task as well as in the sub-tasks. The results obtained showed that it is possible to use the memory-based method in hierarchical environments with high-level tasks and compute the probabilities of success to be used as a basis for explaining the agent's behavior.

translated by 谷歌翻译

Efficient Optimization with Higher-Order Ising Machines

Connor Bybee , Denis Kleyko , Dmitri E. Nikonov , Amir Khosrowshahi , Bruno A. Olshausen , Friedrich T. Sommer

分类：神经与进化计算

2022-12-07

A prominent approach to solving combinatorial optimization problems on parallel hardware is Ising machines, i.e., hardware implementations of networks of interacting binary spin variables. Most Ising machines leverage second-order interactions although important classes of optimization problems, such as satisfiability problems, map more seamlessly to Ising networks with higher-order interactions. Here, we demonstrate that higher-order Ising machines can solve satisfiability problems more resource-efficiently in terms of the number of spin variables and their connections when compared to traditional second-order Ising machines. Further, our results show on a benchmark dataset of Boolean \textit{k}-satisfiability problems that higher-order Ising machines implemented with coupled oscillators rapidly find solutions that are better than second-order Ising machines, thus, improving the current state-of-the-art for Ising machines.

translated by 谷歌翻译